Coordination of multiple behaviors acquired by a vision-based reinforcement learning

نویسندگان

  • Minoru Asada
  • Eiji Uchibe
  • Shoichi Noda
  • Sukoya Tawaratsumida
  • Koh Hosoda
چکیده

A method is proposed which accomplishes a whole task consisting of plural subtasks by coordinating multiple behaviors acquired by a vision-based reinforcement learning. First, individual behaviors which achieve the corresponding subtasks are independently acquired by Q-learning, a widely used reinforcement learning method. Each learned behavior can be represented by an action-value function in terms of state of the environment and robot action. Next, three kinds of coordinations of multiple behaviors are considered; simple summation of di erent action-value functions, switching action-value functions according to situations, and learning with previously obtained actionvalue functions as initial values of a new action-value function. A task of shooting a ball into the goal avoiding collisions with an enemy is examined. The task can be decomposed into a ball shooting subtask and a collision avoiding subtask. These subtasks should be accomplished simultaneously, but they are not independent of each other. Three kinds of coordinations are compared with each other by computer simulations and our on-going real experiments are explained.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Behavior coordination for a mobile robot using modular reinforcement learning

Coordination of multiple behaviors independently obtained by a reinforcement learning method is one of the issues in order for the method to be scaled to larger and more complex robot learning tasks. Direct combination of all the state spaces for individual modules (subtasks) needs enormous learning time, and it causes hidden states. This paper presents a method of modular learning which coordi...

متن کامل

Incremental acquisition of behaviors and signs based on a reinforcement learning schemata model and a spike timing-dependent plasticity network

A novel integrative learning architecture based on a reinforcement learning schemata model (RLSM) with a spike timing-dependent plasticity (STDP) network is described. This architecture models operant conditioning with discriminative stimuli in an autonomous agent engaged in multiple reinforcement learning tasks. The architecture consists of two constitutional learning architectures: RLSM and S...

متن کامل

Hybrid coordination of reinforcement learning-based behaviors for AUV control

This paper proposes a Hybrid Coordination method for Behavior-based Control Architectures. The hybrid method takes in advantages of the robustness and modularity in competitive approaches as well as optimized trajectories in cooperative ones. This paper shows the feasibility of this hybrid method with a 3D-navigation application to an Autonomous Underwater Vehicle (AUV). The behaviors were lear...

متن کامل

Cooperative Behavior Acquisition for Moble Robots in Dynamically Changing Real Worlds via Vision-Based Reinforcement Learning and Development

A visioin-based reinforcement learning method is proposed to acquire cooperative behaviors in dynamic environments in which each agent works with his/her teammates to achieve the common goal against opponents. The method estimates the relationships between learner's behaviors and other agents' ones in the environment through interactions (observation and action) using the method of system ident...

متن کامل

A Vision-Based Reinforcement Learning For Coordination Of Soccer Playing Behaviors

A method is proposed which acquires a purposive behavior of shooting a ball into the goal avoiding collisions with an enemy. In [ Asada et al., 1994 ] , we have presented the soccer robot which learned to shoot a ball into the goal without any enemy, using the Q-learning, one of the reinforcement learning methods. Since a simple extension of the method is not practical due to its huge state spa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994